Instruction Wake-Up in Wide Issue Superscalars

نویسندگان

  • Soner Önder
  • Rajiv Gupta
چکیده

While the central window implementation in a superscalar processor is an effective approach to waking up ready instructions, this implementation does not scale to large instruction window sizes as those that are required by wide issue superscalars of the future. We propose a new wake-up algorithm that dynamically associates explicit wake-up lists with executing instructions according to the dependences between instructions. Instead of repeatedly examining a waiting instruction for wake-up till it can be issued, this algorithm identifies and considers for wake-up a fresh subset of waiting instructions from the instruction window in each cycle. This subset of instructions are the ones present in the wake-up lists of completing instructions. The direct wake-up microarchitecture (DWMA) that we present is able to achieve approximately 80%, 75% and 63% of the performance of a central window processor at high issue widths of 8, 16 and 32 respectively when an effective memory disambiguation mechanism [13] is employed for load speculation. In contrast, the wake-up algorithm proposed in constructing the dependence-based microarchitecture (DBMA) [15] can only achieve approximately 62%, 46% and 45% of the performance of a central window processor at issue widths of 8, 16 and 32. Thus the DWMA processor greatly outperforms the DBMA processor.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimization of Instruction Fetch for Decision Support Workloads

Instruction fetch bandwidth is feared to be a major limiting factor to the performance of future wide-issue aggressive superscalars. In this paper, we focus on Database applications running Decision Support workloads. We characterize the locality patterns of ia database kernel and find frequently executed paths. Using this information, we propose an algorithm to lay out the basic blocks for imp...

متن کامل

Eecient Use of Processing Transistors for Larger On-chip Storage: Multithreading 1

The current trend of aggressive dynamic scheduling in superscalar processors is reaching a point of diminishing returns. This calls for an architecture that utilizes the processing transistors better, thereby freeing up space to be used for more on-chip storage. Recent proposals have included multiple conventional superscalars on a chip (which we call superchip) and multithreaded superscalars, ...

متن کامل

Multiple Branch and Block Prediction

Accurate branch prediction and instruction fetch prediction of a microprocessor are critical to achieve high performance. For a processor which fetches and executes multiple instructions per cycle, an accurate and high bandwidth instruction fetching mechanism becomes increasingly important to performance. Unfortunately, the relatively small basic block size exhibited in many general-purpose app...

متن کامل

Simultaneous multithreading exploits both instruction - level and thread - level parallelism by issuing instructions from different threads in the same cycle . SIMULTANEOUS MULTITHREADING : A Platform for Next - Generation Processors

s the processor community prepares for a billion transistors on a chip, researchers continue to debate the most effective way to use them. One approach is to add more memory (either cache or primary memory) to the chip, but the performance gain from memory alone is limited. Another approach is to increase the level of systems integration, bringing support functions like graphics accelerators an...

متن کامل

Chip Multiprocessors – A Cost-effective Alternative to Simultaneous Multithreading

In this paper we describe the principles of the chip multiprocessor architecture, overview design alternatives and present some example processors of this type. We discuss the results of several simulations where chip multiprocessor was compared to other advanced processor architectures including superscalars and simultaneous multithreading processors. Although simultaneous multithreading seems...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001